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Abstract 

We present a review of the properties of generalized domain wall Fcrmions, based on 
a (real) Mobius transformation on the Wilson overlap kernel, discussing their algorithmic 
efficiency, the degree of explicit chiral violations measured by the residual mass (m res ) and 
the Ward-Takahashi identities. The Mobius class interpolates between Shamir's domain wall 
operator and Borici's domain wall implementation of Neuberger's overlap operator without 
increasing the number of Dirac applications per conjugate gradient iteration. A new scaling 
parameter (a) reduces chiral violations at finite fifth dimension (L s ) but yields exactly the 
same overlap action in the limit L s — > oo. Through the use of 4d Red/Black preconditioning 
and optimal tuning for the scaling a(L s ), we show that chiral symmetry violations are 
typically reduced by an order of magnitude at fixed L s . At large L s we argue that the 
observed scaling for m res = 0(1/ L s ) for Shamir is replaced by m res — 0(1/ L 2 ) for the 
properly tuned Mobius algorithm with a — 0(L S ). 



1 Introduction 



Perhaps the most important theoretical development in lattice field theory at the end of the 
last century was the discovery of Fermion actions that respect exact chiral symmetry at finite 
lattice spacing. The consequence of this is a clean separation between chiral symmetry breaking 
(i.e. non-zero quark masses) and Lorentz breaking (i.e. non-zero lattice spacing). Even at finite 
lattice spacing, this allows for a rigorous understanding of topology and greatly simplifies the 
numerical extrapolations to obtain renormalized correlation functions in the continuum limit 
with the light quarks masses at their physical value. 

The key idea in evading the Nielsen- Ninomiya no-go theorem [TJ [2] , which forbade the 
construction of lattice Fermion action with chiral symmetry under rather general conditions, 
was introduced by Kaplan [3]. In his construction four dimensional chiral zero modes appeared 
as bound states on a mass defect or 3-brane in a five dimensional theory. Much like the work 
of Callan and Harvey [4j in the continuum, anomalous currents in the 4 dimensional theory 
are understood as the flow on or off the mass defect of conserved 5 dimensional currents. This 
work led to two concrete realizations of lattice Fermions with chiral symmetry: the domain wall 
Fermions (5j [6] and the overlap Fermions [TJ El EJ \W\ ITT] . The domain wall formulation retains 
the five dimensional nature of the original idea, while the overlap is a direct construction of a 
four dimensional effective action. It was subsequently realized, that the 4d effective Fermion 
operator, D ov (m), of the domain wall action also satisfies the so called Ginsparg- Wilson (GW) 
relation [12] , 

75^(0) + £C(0)75 = 2i? 7 5 , (1-1) 

where D ov (0) is the massless Dirac operator and R = 0{a) is a local operator. Taking R = 1 
in lattice units, a solution to the GW relation is D ov (0) = \ + ^^[H^] where is the Dirac 
Hamiltonian for the 5-th dimension. Clearly the on-shell zero mass Fermion action is chiral: 
{75, ^"^(O)} ~ 0. The Ginsparg- Wilson relation was discovered in the context of an exact 
renormalization group improved Fermion actions in the free field limit [12J. More than a decade 
later, it was realized |13j , that the classical (or so called perfect action |14} 115]) approximation 
to the renormalization group for the Fermion actions in a fixed gauge background still satisfies 
the GW relation. The Ginsparg- Wilson relation implies a modified lattice form of infinitesimal 
chiral rotations and it can be viewed as an alternative explanation of how the conditions of the 
Nielsen- Ninomiya theorem are circumvented [16J. 

Ultimately the domain wall and overlap Fermions are equivalent, requiring numerical 
algorithms which implement an approximation to the GW relations. In the domain wall formu- 
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lation, this approximation is a consequence of the finite extent of the fifth dimension, L s < oo, 
whereas for the overlap algorithm a finite rational approximation to the sign function, eL s [H^\. 
In both cases the resulting lattice actions have small chiral symmetry violations parameterized 



by the defect in the GW relation (see Eq. 2.5 below), which in principle can be reduced as 



needed to levels that do not seriously affect the physics. 

The Mdbius generalization for domain wall Fermions was introduced 7 years ago \17\ 
[TB] but is just now beginning to make its way into large scale simulations. Not only has this 
improvement not been fully exploited, there are new opportunities to leverage this algorithm with 
subsequent developments such as Gap Fermion [T3] and as a preconditioner as, for example, in the 
MADWF algorithm [20] using Pochinsky's MDWF inverter code [2Tf or multigrid methods [22"1 
[23"1 IM] . Consequently it is timely to give a review of some of the basic formalism that leads 
to the Mobius generalization of domain wall Fermions. In addition, the Mobius generalization 
calls for a more careful arrangement of the standard analysis of domain wall Fermion properties. 
While very little of this formalism is entirely original [3 El UHl HH [25l EH [271 US], we seek 
to abstract the basic structure of the domain wall operators and its equivalence to Overlap in 
a higher level framework so that the Mobius class of generalized domain wall operator is only 
visible at the level of implementation much in the spirit of object oriented software practices. 
Consequently we will define precisely the mapping between the 5d domain wall action and the 
effective 4d Overlap operator at fixed L s , including the general way to formulate residual chiral 
breaking for correlators and the Ward-Takahashi identities. 

The Mobius generalization introduces two new parameters in the domain wall fermion 
action. One parameter controls the kernel of the resulting Overlap operator allowing for a 
continuous family of kernels that have the Neuberger |10| [TT] and the Shamir kernels [5] as 
special cases. The other is a scaling parameter a for the kernel that drops out of the 4D action 
in the L s — > oo limit, providing a better approximation to the GW relation at finite L s . In this 
sense it is proper to consider the scaling parameter a as offering a better algorithm for the 
same operator. For finite L s the result is that the Mobius generalization offers a very substantial 
algorithmic advantage at fixed residual mass, particularly at large L s . For example in current 
practice to get a sufficiently small residual mass for thermodynamics |29j and N = 1 SUSY [30] 
the Shamir algorithm with a fifth dimension as large as L s = 48 has been used. Here a rescaled 
Mobius gives to reasonably high accuracy an equivalent action by rescaling to L s = 12 — 16 
or a rescaling parameter a = 3 — 4. By reducing the extent of the 5 th dimension, the Dirac 
solver should be accelerated by roughly this factor a in these applications. Empirical results 
are presented here to support this estimate. In addition, it is even possible to further reduce 



L s at fixed m res , as noted in the conclusion (see Figs. 10 and 11), by combining the Mobius 
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algorithm with Gap method [19|. Consequently the Mobius algorithm can very substantially 
accelerate inverters in analysis and evolution code with speed up depending on the degree of 
chiral symmetry sought. 

Finally we recognize interest in combining algorithms based on the equivalence of 
overlap and domain wall formalisms, since each maybe optimal at different stages of a single 
Monte Carlo calculation. To accomplish this one needs to establish a precise equivalence between 
the domain wall and overlap formulations at finite lattice spacing and finite separation L s of the 
domain walls. Here we extend to the Mobius implementation the exact mapping between the 
domain wall and overlap Fermion correlators , 

(0[q,q]) DW = (0[^]) m . (1.2) 

We refer to this mapping as the DW/Overlap Correspondence. Of course for the Shamir 
action this mapping has been dealt with extensively in the literature |;7, 8. 9j [iTfllTl"l l2"5 l [26 1 127 1 [28] . 
Also we take the opportunity to develop a consistent notation for this correspondence, extending 
it to vector and axial currents and their Ward-Takahashi identities. By studying this for finite 
L s , we identify in both domain wall and overlap formulation the consequence of violating chiral 
invariance due to approximations (at finite L s ) that result in an imperfect realization of the 
Ginsparg- Wilson identity. The correspondence is based on the descent relations of Callan and 
Harvey [I] that motivated Kaplan's original idea. In the 5d space there are only vector currents 
since 4d parity becomes a reflection of the 5-th direction. The symmetric and anti-symmetric 
projections in the 5-th direction for the 5d vector current become the effective 4d vector and 
axial currents respectively of the effective overlap action. 

In this paper we separate the general analysis from the specific details that depend 
on the choice of the (domain wall, overlap) action and its approximation algorithms (using for 
example rational polynomial approximation, domain wall, etc.). In Sec.[2]we begin by a general 
statement of the choice of actions and the impact on an approximation to the Ginsparg- Wilson 
condition for chiral Fermions. Following this is the specific domain wall action for the Mobius 
Fermion generalization. Sec. [3] explains the necessity of a 4d Red Black preconditioning for the 
Mobius Fermion which is as effective as the 5d Red Black scheme used for the Shamir variant. 
In Sec. [4] we present the results from detailed numerical tests demonstrating the effectiveness 
of our formulation in reducing the cost of implementing domain wall Fermions. Sec. [5] presents 
the general mapping between a domain wall action and the overlap action that proves their 
equivalence at finite lattice spacing and given rational approximation. The map is applied to 
construction of the 4d vector and axial vector currents form the 5d vector domain wall current. 
In Sec. [6] we apply the equivalence formalism to establish the Ward-Takahashi identities for 
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vector and axial current operators as well as derive the axial and vector currents for Mobius 
Fermions. In addition, in this section we construct from the axial ward identity the residual 
mass as a measure of the violation of chiral symmetry due to finite L s approximation to the 
overlap operator and related it to the violation of the Ginsparg- Wilson relation. 



2 Mobius Domain Wall Fermions 



Domain wall and overlap Fermions may be viewed as two alternative algorithms for generating 
chiral Fermions that satisfy the Ginsparg- Wilson relation. This mapping between domain wall 
and overlap Fermions is even useful when chirality is approximated by domain wall Fermions with 
finite separation between the walls in the 5 th axis. Throughout this review, we shall emphasize 
this equivalence by identifying the effective 4d overlap operator, 

Dg°Xm) = + ^W[ 7 5 D kernel (Ms)] , (2.1) 

resulting from the corresponding domain wall implementation at finite extent L s for the 5 th 
dimension. Strictly speaking we should always designate this approximation to the overlap 
operator by Dov s \m) but to simplify the notation we leave this dependence on L s implicit 
throughout. The class of suitable overlap operators is quite large, dictated by the choice of the 4d 
kernel, D kernel (M5) , and a particular algorithm to approximate the sign function: €l[x] — e[x] = 
x/\x\. For example the standard domain wall implementation gives the polar approximation [61\ 
1261 [32] to the sign function (see Fig. [8]) , 

(1 + H 5 ) L --(1-H 5 ) L ° 
tL ' m ~ (l + W + Cl-^- ' ( } 

where the 5 th time "Hamiltonian" is H§ = -y^D kernel (M^,) with transfer matrix T = (1 — H^)/{1 + 
H§). There are many other ways to generate polynomial or rational polynomial approxima- 
tions [33j to the sign function, some of which have a natural representation as a local action 
in the h th time. The 5 th time interpretation of Neuberger and Kaplan provide a very intuitive 
method to impose chiral symmetry, as we will see when discussing the axial current. Only in the 
limit L s — > 00, does the effective domain wall operator reproduce exactly Narayanan-Neuberger's 
overlap Fermion [TJ El [9] . 

The exact lattice chiral symmetry (at m = and L s = 00) is guaranteed by the G-W 

relation, 

7sA™(0) + D ov (0) l5 = 2D OV (0) 15 D OV (0) . (2.3) 
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We may recast G-W relations as, 7s-D O u(0) + D ov (0)js = , by introducing a new 75 operator, 

75 = 75(1 - 2D OV (0)) = -e[H 5 ] , (2.4) 

which allows one to realize the infinitesimal chiral transformation on quark fields as Sip = 
^75, Sip = 75?^. Although the 75 depends non-locally on the gauge fields, since the G-W is 
equivalent to 75 = 1, it does allow an unambiguous definition of chiral projection operators: 

For finite L s the violation of chiral symmetry is given by the error, A£ s [U5], in the 
Ginsparg- Wilson relation: 

2 75 A Ls [# 5 ] = 75^0,(0) + D ov (0) l5 - 2^(0)75^(0) = ^ 75 (1 - e|[F 5 ]) . (2.5) 



From Eq. 2.4 this is equivalent to AL a [Hs] = (1 + 75) (1 — 7s)/4, which is a natural way to 
measure the failure to implement exactly the projectors: P± = (1 ± 75)/2. This GW chiral 
violation operator, Al s [Hs], is the unique measure of chiral symmetry violations applicable 
to any (not necessarily a domain wall) scheme to implement chirality. 

To emphasize this point further, consider how global chiral symmetry of the Fermion 

action, 

S ov = '^2'ip x D ov (m) X yipy = ip x D ov (0) xy ipy + rmp x (l - D m (0 ) ) ip x , (2.6) 

xy 

is modified both by the implicit violation due to an approximate overlap action and by the 
explicit breaking by the bare quark mass term, m. The mass term [j] is given in terms of the 
scalar density, 

S(x) = i) x {l-D ov (x))ip x = tp x + ^ 5l5 ip x , (2.7) 
paired with the pseudo-scalar, 

P(x) = to(l - D ov {x))^ x = V^^p 5 ^ , (2.8) 

under the chiral transformation: 5S(x) = 2P(x),5P(x) = 2S(x). At finite L s following the 
analysis in Ref. [26], we continue to define the chiral transformation using 75 = 75(1 — D ov (0)). 
The chiral transformation of the action in Eq. |2.6| now yields two contributions, 

5So V = ip[y 5 D ov (m) + D ov (m)j 5 ]ip = mip(>y 5 + %)ip + 2ip>y 5 A L ip , (2.9) 



x We note that the physical range for the mass is < m < 1. Indeed it is tempting therefore to rescale the 
action by (1 — m)~ 1 So V = 4> x D ov (0)xy 4> y + m,^^ defining additive bare quark mass m q — m/(l — m) since 
the limit m q — > 00 (or m = 1) corresponds to the decoupling of the Domain wall quarks by cancellation with 
Pauli-Villars operator Dpy = D ov (m — 1). However the conventional choice has the advantage of introducing 
the mass operator as the correct scalar partner S(x) to the pseudo-scalar density P{x). 
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the first one due to the quark mass and the second one due to the finite L s approximation, 



expressed as same GW violation operator, A x r y , defined above in Eq. 2.5 Like the overlap 



operator itself, this chiral violating operator, A L , should also fall off exponentially in units of 
the lattice spacing so that it may be approximated by an effective Lagrangian, 



V>A L ?/> rrires^ + ci^7m(^a« ~ + i ^a^F^ij) + c^O^ - iA^-ty + • • • , (2.10) 

expanding in the lattice spacing. The terms in increasing dimension define a residual mass, m res , 
the 4d wave function renormalization, the 5d operators or the clover term and so on respectively. 



In Sec. 5.1, we will extend this analysis to examine the local breaking of chiral symmetry in 
the Ward-Takahashi identities for the axial current. There we show that this same operator, 
■075 A^ip, is the correction to the divergence for axial current. Its vacuum to pion matrix element 
is the conventional definition of the residual mass [6A\ [55"1 [55] . 

In spite of the exact mapping between overlap and domain wall Fermions, unfortu- 
nately the most common implementations use different kernels. For the overlap it is natural to 
use the Wilson (or Neuberger /Borici) kernel, 

D Borici (M 5 ) = a 5 D Wilson (M 5 ) , (2.11) 

while for the Shamir domain wall implementation [5], the kernel is 

° (Ms) " 2 + a 5 Z?™"-(M 5 ) ' (2 - 12) 

where we include in the definition the lattice spacing as in the fifth direction, although it is 
generally set to 05 = 1 in units of the space-time lattice. Both the Shamir and Neuberger/Borici 
kernels are constructed from the Wilson lattice Dirac operator, 

DZ Uson [U^x), Ms] = (4 + M 5 )S x , y - i [(1 - 7m )£V(*)<W, + (1 + 7^I(i/)<Wm] > ( 2 - 13 ) 



with a negative mass term M5 = 0{— 1). Unfortunately the Shamir kernel (2.12) is computa- 



tionally expensive to use in overlap codes. On the other hand, as first realized by Borigi 



the simpler Wilson overlap kernel (2.11 ) can easily be realized as a domain wall action by adding 
a specific next to nearest neighbor term in the 5 th direction. 

Here we found that an equally computationally efficient domain wall action allows one 
to introduce the generalized kernel, 

n M oeUus (M , (b 5 +C 5 )D W ^(M 5 ) 

" 2 + (65 - c B )DW*™(M s ) ' { ' 
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Figure 1: The error, 1 — e^ s (A) for the polar decomposition at L s = 16 is plotted against an 
eigenvalue of the kernel H§ for A £ [0.1,10]. In the polar approximation the error is positive 
semi-definite for A > 0, anti-symmetric for A —> —A and symmetric under A —> 1/A. This is 
compared with |1 — ei s (X)\ for the Zolotarev polynomial at L s = 10 whose error fluctuates in 
sign. 



referred to as Mobius Fermions because the 3 parameters M5 , 65 , C5 are equivalent to a real 
Mobius transformation, D Wilson (-l) [a + bD Wilson (-l)]/[c + dD Wilson {-!)}, of the Wilson 
operator - the most general conformal map preserving the real axis. It encompasses both the 
Shamir and Borigi form as special cases. Obviously the standard polar decomposition for the 
Shamir kernel is recovered with 65 = as and C5 = while the Borigi truncated overlap action 
is implemented for 65 = C5 = 05. In addition to the Shamir parameter, 05 = 65 — C5, there is a 
new scale factor, a = (b^ + c^/a^, which turns out to have a major impact on reducing chiral 
symmetry violation at finite L s . Just rescaling the Shamir kernel we have 

n J~)Wilson ( tut \ 

D MoeMusr M , = "5 I 5) a D Shamir (M 5 ) . (2.15) 

v ; 2 + a 5 D Wllson (M 5 ) y J y ' 

We will also extend our analysis to s-dependent coefficients (a(s) = bs(s) + c^(s),a^(s) = 
bs(s) — C5(s) ) so that this presentation includes the Zolotarev approximation (for example) 
as described in Ref |37| 138] . See Fig. [I] for an illustration of the error in the sign function for 
the polar versus the Zolotarev approximation. 

The scaling parameter plays a particularly important role in Mobius Fermions. Since 
the sign function is scale invariant, if we simply rescale the Shamir action, tLs\H*>\ = ti JS / a [aH^\, 
the Mobius action is identical to the Shamir action in the L s — > oo limit. Thus it is proper to 
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Figure 2: The top curve represents the sign function £L s (aA) plotted against A for Shamir 
(a = 1) at L s = 48 compared with scaled Mobius (a = 4) at L s = 12, which are indistinguishable 
on a loglog plot. This contrast with the middle and lower curves for Shamir at L s = 32 and 
L s = 16 respectively which do degrade the chiral approximation substantial. 



regard this as a new algorithm with improved chirality at finite L s . A heuristic explanation for 



this is easily understood. Consider how the polar approximation in Eq. 2.2 



e Ls (H 5 ) = Y,\ X )tL s W(M, (2-16) 

A 

in the spectral representation approaches the sign function e(iJs). The polar approximation, 
illustrated in Fig. [TJ is exact at A = ±1 and has an error A Ls (A) = (1 - e| (A))/4 that satisfies 
the inversion symmetry A^ s (A) = Al s (1/\). It converges exponentially for increasing L s in a 
window 0(1/ L s ) < |A| < 0(L S ). However since the spectrum of H§ is bounded |A| < \ ma x 
the upper end of the window is wasted whenever L s > 0(X max ). Consequently by using the 
Mobius scaling parameter a, the log spectrum can be shifted and for a\ ma x < 0(L S ) we take 
advantage of the entire window in the polar approximation, greatly improving the approximation 
for the small eigenvalues with essentially no damage to the approximation at the high end of the 
eigenvalue spectrum. For example with the Shamir kernel with as = 1, X m ax — (8 — Ms)/(10 — 
M5) ~ 3/4 so we can rescaling the Shamir kernel by a = 0(L S ) allowing the window to cover 
better the low eigenvalues that are mostly responsible for the observed explicit chiral symmetry 
breaking. 

In addition note that for low eigenvalues, the sign function obeys the scaling rule, 
e L s (A) ~ ^L s /a( a ^)- Indeed as illustrated in Fig. I2J this scaling rule holds quite well in the 
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the entire spectral range of the Hermitian kernel H§ so the Domain Wall operator is essentially 
unchanged under this "approximate equivalence relations" : 

Scaling Rule: Shamir at L s ~ Mobius at L s /a . (2-17) 

Since each eigenvector of the operator and therefore the operator itself is preserved, the Mobius 
rescaling can be understood as an improved algorithm for essentially the same lattice Dirac 
action. For codes requiring an aggressive chiral approximation, the advantage of the Mobius 
algorithm will be dramatic. For example, recently the Shamir domain simulation have required 
L s = 48 for J\f = 1 SUSY [30] and L s = 32 for thermodynamics [29J or even larger L s to 
get sufficiently small m res . However as demonstrated in Fig. [2] the rescaling from Shamir at 
L s = 48 to Mobius at L s = 12 and a = 4 has no visible effect on a loglog plot. The difference is 
| £48 [A] — ei2 [4A] | < 10~ 3 for all the vectors of H^. Consequently any simulation using L s = 32 or 
larger should benefit very substantially by a factor of 2-4 using Mobius in the range of L s = 12 
to 16 with equally small values of m res . 



2.1 Domain Wall Action for Mobius Fermions 

We now proceed to construct the domain wall action that gives rise to the Mobius kernel. It is 
useful to introduce two generalized kernels: 

£>W = b 5 (s)D Wilson (M 5 ) + 1 and L> W = c 5 (s)D Wilson (M 5 ) - 1 , (2.18) 

with s = 1, 2, • • • L s . We keep an arbitrary s-dependence to allow for generalization of our formu- 
lation to other five dimensional approximations to the overlap such as the Zolotarev polynomial 

(s) (s) 

approximation. Also the s-dependence of D + and D_ allows for generalizations to formula- 
tions that require a gauge field dependence of the fifth dimension such as the domain wall filters 
suggested in Ref. 139 . Now our generalized domain wall Fermion action^] is 

s=l s=2 s=l 

- m^iD i l ) P + ^ La - m^ La D ( L s) P^ 1 , (2.19) 

where we follow the conventions of Edwards and Heller [25], placing the left chiral modes, 
q L = P_\E'i on the s = 1 wall and the right chiral modes, q R = P+^Ls on the s = L s wall as 

2 Through out we adopt the convention of upper case for 5d domain wall fields (e.g. ^,3) and lower case for 
4d overlap fields (e.g. ip x ) and 4d domain wall fields on the boundary (e.g. q = [P 
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depicted in Fig[3j P± = |(1 ± 75) are the chiral projectors. Written as an L s x L s tridiagonal 
matrix the Mobius operator is 

1(1) 



D 



DW, 



m) 



-mD {La) P_ 



D W P _ 
Df 
D®P+ 



D 





(3) 



D (2)p_ 












D 



(Ls) 



(2.20) 



For the polar decomposition, it is sufficient to choose constant coefficients 65(5) = 65, cs(s) = C5 
for all s with the kernel taking the form, 

D Moebius (M s = (h+06)D WU9On (M S ) = D+ + D- 

{ 5) 2 + (65 - c 5 )D Wilson (M 5 ) D+-D-' K ' ' 

It is worth noting that there maybe a computational advantage to placing the chiral projectors, 
P±, to the left of Converting to this form is a trivial similarity transformation, D DW (m) — > 
D DW (m) = DZ 1 D DW (m)D- where we introduce the matrix, 



D. 



Diag 



D (i) D W D W 



D 



to 



(Ls) 



diagonal in the 5 th axis. Assuming reflection symmetry D 
generalization of 75-Hermiticity is 

T 5 D DW (m)T 5 = D^ DW (m) . 

Here Ts = ^IZDZ 1 is defined to include inverting the 5-th axis, 



to) 



D (L.+l-s) the 



(2.22) 



(2.23) 



and rescaling by DZ 1 . The fact that H DW = ^KDZ 1 D DW (m) is a Hermitian operator guar- 
antees that the effective 4d overlap operator is "75 Hermitian" [^j 



2.2 Domain Wall to Overlap Reduction 

The equivalence between domain wall and overlap Fermions is constructed by dimensional re- 
duction from 5 to 4 dimensions. Since the Fermion action is quadratic, there is an explicit map 

3 In terms of the transfer operator on the (s + 1, s) link introduced later this condition is T s = Tl s +i- s - In 
the case of the Zolotarev approximation, one uses the condition that s-dependent transfer matrix commutator 
vanishes ([T s ,T s i] = 0) to show 75 Hermiticity of the 4d effective operator. 
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s — > 



Figure 3: Domain wall convention with the physics Right/Left chiral mode at s = 1 and s = L s 
respectively for an approximation of the fifth dimension by width a^Lg. The Pauli-Villars oper- 
ator has anti-periodic boundary condition and the Dirac operator Dirichlet boundary condition 
at zero quark mass. 



to the 4d overlap operator from the 5d domain wall action. To accomplish this is a straight for- 
ward application of LDU decomposition for both the Dirac, (D DW (m)V) and Pauli-Villars 
(D DW (1)V) matrices, as demonstrated in Appendix [X} This procedure makes use of a pivot 
matrix, 



V 



P 




P+ 



This matrix, V s 



5 SS >P- + 5 S > 



s',(s+l) modL s r + 











p+ 

p 



(2.24) 



is a unitary permutation, V~ l = = 1ZVTZ, 
which rotates the positive chiral mode at s = L s to the same position as the left chiral mode at 
s = 1, so that the new 4d Dirac field is reassembled at s = 1: 



P-* M +P+* <Bl .L. = [P t *kl. 



(2.25) 



The main result of the LDU decomposition in Appendix[A]is that we are led to consider 
the domain wall Dirac operator preconditioned by the Pauli-Villars (corresponding to infinite 
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quark mass) operator, 



K DW (m) 



<p\. 



1 



D DW {m)V . 



D DW {1) 

This matrix has a remarkably simple form given by 



(2.26) 



K DW {m) = V^ 



D DW (m)V 



D DW {1] 



D ov (m) 

-(l-m)Af 1 

-(l-m)Af 

-(l-m)A? 





1 







1 



-(l-m)Af 



•• 

•• 

•• 

•• 

1 



(2.27) 



Its inverse, or propagator matrix, has the form 



\DW 



( m ) = V ] 



D DW (1)V 



D DW { 



m 



D-}(m) 

(l-m)AfD- v 1 (m) 1 

(l-m)AfD- 1 (m) 

(l-m)AfD- v 1 (m) 





1 






1 



(l-m)AfD- 1 (m) 



•• 

•• 

•• 

•• 

1 



(2.28) 

The results for the K DW matrix in Eq. 2.27 and its inverse in Eq. 2.28 play a fundamental 
role in understanding the dynamics of the domain wall Fermion construction. They state the 
remarkable fact, that once the domain wall Fermion operator, D DW (m), is "preconditioned" by 
the Pauli-Villars operator, D DW (1), the propagation into the fifth dimension from the chiral 
domain wall is "instantaneous" . All the essential properties are independent of the particular 
structure of the domain wall operator. The "instantaneous" property is easily demonstrated by 
rewriting the propagator as the sum of two terms, 

1 „nw,*\~~ -. „+ 1 



pt 



D DW, 



m 



-D DW (l)V = l + V ] 



D DW (m) 



(D DW {1) -D uw {m))V . 



DW i 



(2.29) 



The first term contributes exclusively to the non-zero value on the diagonal and the second to 
the first column, 



\{D DW {!) - D uw [m))V] 



DW i 



s ,s 



-(1 



(2.30) 



because the mass parameter in the domain wall operator is restricted to the link connecting the 
left and right walls. The zeros in the above equation represent the cancellation between bulk 
modes of the Dirac and Pauli-Villars terms. In the continuum the cancellation occurs [ID] for 
all off-diagonal terms but otherwise is very similar. 
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The non-zero elements depend only on the transfer matrix along the 5 th dimension, 



Ts = TTW s Wlth Hs = l5 2 + (b 5 (s)-c 5 (s))DWUson {M5 y (2-31, 
The effective overlap operator is then given by 



. l + m 1-m Y L °-l 



(2.32) 



where T is = T 2 • • -T^ . The rest of the first column are built out of partial ordered 
products 

T^T^-'-Tr 1 



A 



-lrp-l 
l 2 

1 + T- 



/T"i — 1 rp — 1 rp — 1 

1 s+l 1 s+2 " ' 1 L S 



1 + T- 



(2.33) 



We also note that the GW violation operator, A^ s , in Eq. 2.5 has a similar expression joining 
the left and right wall, 



A Ls = A* Af +1 



1 



J-Ls/2 + jL 3 /2 



1 - e| (H* 



(2.34) 



Clearly convergence to the exact chiral limit, Al s ^ oq = 0, depends on the spectrum of the 
transfer operator as is evident in the original construction of the overlap operator by Neuberger 
et al 0181 [TO]. 



2.3 Overlap to bulk Domain Wall reconstruction 

We can essentially reverse the 5d to 4d dimensional reduction at a small cost using the more 
easily inverted Pauli-Villars operator. This is useful in a variety of instances, the most obvious 
being the reconstruction of the full domain wall solution, when you have only saved the solution 
to the effective overlap propagator. Another recent example is the use of a Mobius operator at 
small L s as a preconditioner for larger L s standard Shamir operator, as in the recent Mobius 
Accelerated Domain Wall Fermion (MADWF) algorithm [20J. 

Let's begin by assuming we have the solution to the effective overlap equation, 

D ov (m)iP = b, (2.35) 

and proceed to find the domain wall propagator assuming that its source is given by B s = b 5 s ^i 
on the boundary. The goal is to reconstruct the 5d bulk propagator. In the first step using 
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Eq. 



2.27 



and defining, ip a = tp 6 Sy i, we reconstruct the vector in the interior, 



n = v 



D DW (1 



D 



DW , 



m)Vip 



"(1 
"(1 

fl- 



D ov tp 
m^A^ip 
m)A^ip 



(2.36) 



with the cost of having to invert the Pauli-Villars matrix. Using Eq. 2.28 we see that the 5d 
solution is given by 



D DW (1)VB 



1> 

rnjAl^'ip 
m)A§ip 



D DW {m) 

[1 - m)A^ 

where B s = b 5 St \ is a 5d vector with support on the boundary only. 
Comparing the two results we have 



(2.37) 



Wf°f = diag(£> 



l 

cm i 



-1 ... 



-i)n 



-n. 



s = 1 



(2.38) 



This final step only requires flipping the sign, ty s = —£l s for s = 2, • • • ,L S . The cost of this 
construction is the cost of solving a 5d linear system for the Pauli-Villars matrix which converges 
in O(50) conjugate gradient iterations for typical applications. 

Using this formulation we can construct the conserved axial- vector and vector current 
matrix elements using only the stored overlap propagators. Furthermore, we can use this for- 
malism to define prolongation and restriction operations to coarsen the fifth dimension in the 
spirit of a multigrid solver. However it should be realized that a true domain wall multigrid 
algorithm [24] requires blocking in the 4d to overcome the critical slowing down in the chiral 
limit. 



2.4 4d Hybrid Monte Carlo 



Another related consequence of the fundamental identity in Eq. |2.28 is the ability to reformulate 



Hybrid Monte Carlo evolution for domain wall Fermions restricting the pseudo-fermions to the 
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wall. For example consider the standard approach to HMC in using domain wall Fermions in 
dynamical Fermion calculations |4H W2\ 



det[Dl v D ov \ = j , 



n ■ ■■d(p Ls e 



(2.39) 



for 2 flavor example that introduces pseudo-fermions <f> a in the bulk. If we look at the action 



DL u ov 



£ [<t>* + (1 - m)Af^-> 1 ] t [4> s + (1 - m)Af D-Vi] , 



(2.40) 



s=2 



we see that only the first term contributes to the path integral. The remaining terms integrate 
to unity trivially but they do add unwanted noise to the stochastic estimator. 

From the reconstruction procedure described above the bulk degrees of freedom should 
be redundant. Indeed this is true if we use the identity 



(DlWDovim))- 1 = [A DW A DW % = [7>t 



' D DW {l)rf DW (1) ^JL^ Plii 



D DW (m) 



2.4F 



that follows from Eq. 2.28 to rewrite the determinant in terms of pseudoFermions restricted to 
s = 1. 

det^L^o,] = J d^e-^ Vl ^^) DDw{1)D ^ {1) ^ . (2.42) 

The benefit of removing the redundant random pseudo-fermions from the bulk may be significant. 
Unfortunately, we should also note that the force term now requires two inversions instead of 
one as seen from the expression, 

^ = 5^[4>^D-J w {m)D DW {l)D^ DW {l)D^{m)V^ 

= X ] K [D DW {m)] iP + [D ] DW {m)] X + X ] K [d dw (1)D^ w (1)\ X , (2.43) 

in terms of auxiliary x an d ip fields defined as, 

i> = D- l w {m)D DW {l)rf DW {l) X . (2.44) 

The extra cost of the second inversion might be acceptable due to better scaling for large L s 
or be compensated by better inversion algorithms for the first order system. Exploring these 
possibilities is beyond the scope of this paper. 
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Finally, using the formalism described above we can formulate a preconditioned HMC 
using a small L s domain wall operator as a preconditioner. Then using a multiple time scale 
integrator, one can run most of the calculation using the small L s operator. The correction step 
which will require the inversion of the large L s matrix will only be needed a few times during 
a trajectory. The precise implementation as well as the possible interplay with the Hasenbush 
preconditioner requires detailed experimentation for finding the optimal HMC algorithm for 
DWF dynamical calculations. Here we simply suggest one more trick to be used together with 
the rest of today's HMC methodology. 



3 Red Black Preconditioning 

In order to place the performance of Mobius on an equal footing with Shamir Fermions, it is 
essential to implement red-black preconditioning. For the standard Shamir implementations, 
this step alone accounts for nearly a factor of 3 speed up. However the standard 5d even-odd 
preconditioning is not efficient for the Mobius generalization because the new Wilson operator 
D_ connects 5d even and odd sites (x ± /x and s ± 1). This preconditioning would require a 
Wilson operator inverse as a new inner loop. This problem also occurs for the Borici domain 
wall action and continued fraction approach to the Overlap operator and in general whenever 
the hopping terms for the effective "fifth" dimension is coupled to the spatial lattice. 



To avoid this problem we define a new red-black partitioning that uses a 4d red-black 
lattice without alternating color as you run along the 5-th axis. The result is a red-red and black- 
black matrix that is tridiagonal with constant coefficients. The specific construction is as follows. 
Red-black is chosen as checkerboard on the space-time lattice as x\ + X2 + x% + X4 = even/odd 
for all s £ [0, L s ]. The hopping matrix in 5d are D® r w and D® h W so 



D 



Wilson 



(M) 



Irr -^rb 



DW 
■b 

bb 



with Schur decomposition, 



1 

jjDWt-1 1 
u br 1 tt 1 



I»-D™I?D™ 



D DW {M) 

leads to the red-black Schur complement (or preconditioned matrix): 



1 f-inW 
1 i rr . u rb 

1 



n DW _ 1 _ T -l n DW T -l n DW 
-LJpre — 1 1 bb U br 1 rr u rb 



(3.1) 



(3.2) 



(3.3) 
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With M + = 6 5 (4 + M 5 ) + 1 and M_ = c 5 (4 + M 5 ) - 1, the diagonal blocks in 5d are 



hb 



M + M-P- 
M_P+ M+ M_P_ 
M-P+ M+ 



-mM-P- 
These can easily be inverted with the observation that I r 



—mP+ 



M+ 

A rr P- + AT r P+ where 



(3.4) 



M 4 



1 




-mM_/M + 



M_/M+ 
1 M_/M+ 
1 











(3.5) 



To find the inverse I^ 1 = A~^P- + Aj~ 1 P + , we solve .A rr \I r £ = 6^ and ^4^^^ = 6r inde- 
pendently for each chiral sector. This is performed by Gauss elimination of 0(L S ) steps of 
multiply-add and one division for half the lattice points per site in the space-time volume. 
There is no communication if the s-axis is inside the processing element. The computation cost 
is negligible in comparison with single Dirac applications. 

The closed form solution of A rr ^ = b with a = M_/M + and M = mM_/M + is first 
found for s = L s by forward eliminations, 



Ls-l 



M + (1 - M (-a) 1 -- 1 ) 
and the rest for s = 1, L s — 1 by back substitution 



r \b L + m (-«r %} , 



s=l 



Mi 



(3.6) 



(3.7) 



The same procedure interchanging rows and columns solves, A^ r ^/ = b. However we emphasize 
that in the code it is best to do Gaussian elimination on the fly rather than store the matrix 
inverse. 

Our new (4d) red-black preconditioning for Shamir Fermions is performing as well 
as the original 5d red-black preconditioning in terms of iteration count. For Mobius Fermions 
the speed up due to our variant of red-black preconditioning is about a factor 2.5 for our 
present lattice set. In fact the new version which treats all 5-th axis sites uniformly at fixed 
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Number of Dirac applications 

Figure 4: L s dependence of the residual mass versus the number of Wilson Dirac applications 
required to invert the standard Shamir operator with M5 = 1.8 and 05 = 1 at m = 0.06. 



x is advantageous in fast assembly level code for vector pipelines such as those found in Intel 
processors. Combining this with better data layout using Morton data layout, the Shamir version 
of the Mobius Domain Wall Fermion (MDWF) code of Pochinsky [21] has achieved a factor 2 
improvement relative to earlier implementations. 



4 Numerical Results 



We have made a series of tests of the effectiveness of the Mobius operator relative to the standard 



Shamir formulation. For the Shamir formulation (2.12) where 05 is fixed to 1, one tunes the 



Wilson mass M5 and the extent of the fifth dimension L s in order to achieve the desired degree of 
approximation to the exactly chiral Dirac operator. While the violation of chirality as measured 
by m res appears to decrease exponentially at first for increasing L s , it quickly slows down 
asymptotically to an 0(1/ L s ) power fall off [13] as illustrated in Fig. |4j 



In the Mobius kernel (2.14) in addition to M5 and L s , there are two more parameters, 
the lattice spacing in 5 th dimension, 05 = 65 — C5, and the scaling parameter, a = 65 + C5. The 
parameter 05 represents a modification of H§ and allows us to interpolate between the Borici 
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Number of Dirac applications 

Figure 5: The residual mass for the Mobius algorithm as a function of L s and a at fixed 
M5 = 1.5, 05 = 1 and m = 0.06 in comparison with the standard Shamir algorithm of Fig. [4j 



kernel and the Shamir kernel. However the essential parameter in our scheme is the scaling 
parameter, a, which allows us to optimize the window (see Fig. [I] ) for the sign function (|2.2l) 
at fixed L s . Before describing the details of our numerical investigation, we present in Fig. [5] a 
summary of the result of minimizing the chiral violation as measured by m res at fixed L s . This 
clearly shows the importance of choosing an optimal value of a(L s ) as a function of L s . It is 
remarkable that the residual mass now appears to continue to fall exponentially with L s , if we 



allow this re-tuning of a as L s increases. However in Sec. 6.5 we argue that a tuned Mobius 
algorithm eventually falls asymptotically as 0(1/ L^), in contrast to the standard Shamir action, 
which shows a much earlier cross over from exponential to 0(1/L S ) power behavior at large L s . 

For very small m res the Mobius formulation has the potential for orders of magnitude 
reduction of the explicit chiral symmetry breaking at fixed computational cost. For example in 
our test case, residual masses 6 x 10~ 4 and 3 x 10~ 5 were achieved at L s = 24 and L s = 32 
respectively. Achieving the same residual masses with the Shamir formulation would apparently 
require a prohibitively large domain wall separations of order L s = 10 2 and L s = 10 3 respectively. 
This observation is consistent with the estimate of an 0(1/ L s ) vs 0(1/ L^) asymptotic fall off 
of m res for the standard Shamir algorithm vs the improved Mobius algorithm. 
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4.1 Testing procedure 



We have run a large number of tests but for simplicity we present here an optimization study 
based only on a small sample of 20 quenched /3 = 6.0 Wilson gauge action configurations available 
from the Gauge Connection archive. In spite of this small test sample, we believe the lessons we 
have learned here are general based on our understanding of how the Mobius formulation works. 

Before we proceed in presenting the details of our testing procedure, we note that one 
Mobius operator application does not need more Wilson Dirac operator applications than the 
standard Shamir operation at the same L s . At first glance one might think that this is not the 



case due to the occurrence of the additional off diagonal terms (2.20) to s =t 1 with the Wilson 
Dirac operator. However by first taking a vector gather of the neighboring Dirac spinors at 
s + 1,3, s — 1 no additional Dirac applications are required. The additional vector operation 
on the spinor is insignificant and can be ignored in the total cost estimates. In addition, in 
order to achieve convergence of the conjugate gradient solver for the Mobius operator similar 
that achieved for the Shamir operator, the new 4d Red/Black preconditioning described in Sec. [3] 
had to be invented. This preconditioning also adds an insignificant amount of extra computation. 
In all the graphs, the label number of Dirac applications represents the product of L s times the 
number of iterations per source to fixed precision. 

In comparing the residual chiral symmetry breaking and the cost of the quark prop- 
agator calculation between different values of the parameters, we have at our disposal, we are 
faced with the fact that the quark mass renormalization factor changes as one changes the 
kernel operator H§. Hence estimating the cost of the calculation at constant physics, and the 
physical residual chiral symmetry breaking is tricky. In order to resolve this issue, we adopted 
the following scheme in tuning our bare quark masses. First, as a point of reference for all our 
tests we adopted the Shamir operator with M5 = 1.8 (with 05 = 1.0 and a = 1.0 in Mobius 
parameterization). The RBC collaboration found in |344 [37)] that this M5 is optimal for the 
(3 = 6.0 Wilson gauge action quenched ensemble. This was followed by a variation of the quark 
mass on the Mobius side, until the pion mass agreed with that obtained using the standard 
Shamir action. The renormalization factors, Z m , is the ratio of the Shamir bare quark mass to 
the Mobius bare quark mass needed to obtain equal residual masses. 

From the data in Table [TJ we can deduce that the quark mass renormalization factor 
Z m is mostly sensitive to the parameter 05 = 65 — c$. All other parameters induce very small 
variations on this renormalization factor. For that reason, in comparing residual masses between 
different sets of parameters, only the non-trivial dependence of Z m on 05 has to be taken into 
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account. In all our tests the Shamir quark mass we used is m = 0.06 which gave us a pion mass 
of roughly, m n = 0.44 in lattice units. 



m 05 


m M 5 


m L s 


0.130 2.00 
0.109 1.75 
0.091 1.50 
0.075 1.25 
0.065 1.00 
0.055 0.75 
0.040 0.00 


0.093 1.3 
0.092 1.4 
0.091 1.5 
0.090 1.6 


0.091 12 
0.091 16 
0.091 24 
0.091 32 



Table 1: The quark mass, m, in the Mobius algorithm at fixed m n = 0.44. We vary 05, M5 and 
L s , one at a time, away from the base case: (05, M5, L s ) = (1.5,1.5,16). The dependence on 
a = 65 + C5 is not visible to the accuracy quoted above. 



4.2 Dependence on a 5 and M 5 at fixed L s 

An exhaustive exploration of the Mobius parameter space M5, 05, a for minimizing m res at fixed 
L s and is a laborious task. Fortunately it is easy to develop heuristics that simplify the 
search. Here we describe two limited searches at L s = 8 to illustrate our procedure. These 
explore the optimization of a for a range of values of 05 at fixed M5 = 1.5 and subsequently we 
performed an optimization of M5 fixing 05 to its optimal value 05 = 1.5. 

In FigJHJwe present the residual mass for the Mobius operator at L s = 8 and M5 = 1.5, 
varying 05. The series of points for a given 05 correspond to different values of a. In general, 
the number of Dirac applications grows with increasing a. But for as = 1.75, it can be seen 
that this behavior changes and it is even reversed at 05 = 2.0. From this graph we conclude 
that 05 = 1.5 is the optimal point. It gives roughly the same residual mass as the Shamir at 
L s = 16 with roughly half the Dirac operator applications. To be precise in Fig(6]the values of 
a are increased by 0.1 moving left to right (for increasing Dirac applications) in the intervals: 
a G [1.0-1.3] for a 5 = 0, a G [2.3-2.6] for a 5 = 0.75, a G [2.2-2.5] for a 5 = 1.0, a G [2.1-2.4] 
for a 5 = 1.25 and a G [1.9-2.2] for a 5 = 1.5. For a 5 = 2.0 with the values a = [1.6, 1.9,2.0], the 
points reverse direction moving right to left for decreasing Dirac application and for 05 = 1.75 
in the cross over region with a = [1.7 — 2.1], the first three points move down to the left and 
last point moves back to the right. 
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Number of Dirac applications 

Figure 6: The Mobius algorithm as a function of 05 and a at fixed m = 0.06, [7]M5 = 1.5 and 
L s = 8 in comparison with the standard Shamir algorithm of Fig. |4j The series of points for a 
given as correspond to different values of a enumerated in the text. 



Next we fix 05 = 1.5 to its optimal value and explore the M5 dependence of the 
residual mass for Mobius action shown in Fig. [7j Here as we increase a in all cases the number 
of Dirac applications increase. In Fig. @ moving left to right (for increasing Dirac applications) 
with neighboring point separated by 0.1 we plotted a £ [1.6 — 1.9] for M5 = 1.6, a £ [1.9 — 2.2] 
for M 5 = 1.5, a G [2.3 - 2.6] for M 5 = 1.4 and a € [2.6 - 3.2] for M 5 = 1.3. It can be seen 
that M5 = 1.5 and M5 = 1.4 show equally good performance. M5 = 1.4 achieves a little lower 
residual mass with a corresponding modest increase in the number of Dirac applications. In Fig. [5] 
summarizing our results for the optimal tuning, we choose M5 = 1.5. Our overall conclusion is 
that the scaling parameter a is the most important for optimization. This trend is explained in 



more detail in our discussion of the residual mass in Sec. 6.5 below. We emphasize that if you 



choose to only minimize the residual mass by increasing a in accordance with the scaling rule 



in Eq. |2.17| adjusting aL s < X m ax for a fixed Mobius kernel, the spectral decomposition of the 
effective 4d operator is barely modified. So this procedure can be viewed as essentially as an 
improved algorithm for the same Dirac action, converging more rapidly to the same 4d overlap 
action in the L s — > 00 limit. 

Because we performed all our tests at fixed pion mass, one might ask how this opti- 
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Figure 7: The Mobius algorithm as a function of M5 and a at fixed m = 0.06, as = 1.5 and 
L s = 8 in comparison with the standard Shamir algorithm of Fig. |4j The series of points for a 
given M5 correspond to different values of a enumerated in the text. 

mization depends on the bare quark mass. For that reason we repeated our optimization of a at 
bare quark mass m = 0.02 and found that the choices we made in our original test at m = 0.06 
are still optimal. 

4.3 Zolotarev Polynomials for the quenched fields 

Finally, we should point out that the Mobius formalism allows for the use of the Zolotarev 
approximation to the sign function |37[ [5H] even for 05 7^ 0. One has to introduce an a that 
is dependent on the fifth dimension. For each 4d slice in the fifth dimension a s has to be set 
to the inverse of a root of the Zolotarev polynomial, i.e. for every s G L s , a s = b s + c s is 
equal to a Zolotarev coefficient, where b s — c s = 05. We define an a (without subscript), as 
an overall factor, to act as a * a s . In Fig(8]we study the numerical behavior of the Zolotarev 
approximation, for a setting as shown in FigjTJ i.e. with L s = 10 for Zolotarev and L s = 16 
for the polar decomposition, both positioned logarithmically symmetric around one (i.e. per 
definition with a = 1). Fig(8] shows results for as = 1.0 and 05 = 1.5. For a given a the 
residual masses of the two cases agree, as expected from the polynomials in FigJT] The crucial 
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M 5 =1.5, a 5 =1.0, a=1.0, 2.6 
M 5 =1 .5, a 5 =1 .5, a=1 .0, 2.0, 2.5, 3.0 
M 5 =1.5, Zolotarev, a 5 =1.0, a=1.0, 2.0 
M 5 =1.5, Zolotarev, a 5 =1.5, a=1.0, 2.0 
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Figure 8: Comparison of the Zolotarev and polar decomposition approximation to the sign 
function for varying a and two different 05. 



observation is that for both 05, the Zolotarev approximation, even though it employs a smaller 
L s , requires a larger number of Wilson Dirac applications. This number increases dramatically 
with q. Hence, even though the Zolotarev approximation is optimal in the sense that it achieves 
a given residual mass with the smallest possible L s , in our test it is not optimal numerically, 
since the iterative solver for the quark propagator converges at a slower rate. So far we have 
not found any way to make Zolotarev competitive to the Mobius in the polar form. 

Furthermore, the reflection symmetry and the positivity of the residual mass is lost 
by using the standard Zolotarev approximation. One can restore these properties by using 
polynomials with double roots and arranging them in pairs at locations s and L s — s — 1 in the 
fifth dimension. But the loss of positivity means that m res is no longer be a good measure of 
chiral symmetry violations. As noted in the introduction a trivial shift in the bare quark mass 
by rrires only cancels the first term in the effective chiral Lagrangian. The higher order terms, 
such as the clover term, may or may not be improved by the Zolotarev polynomial. However, 
given that the Zolotarev approximation to the sign function has an error A(x) = (1 — e 2 (A))/4 
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that is uniform across the approximation window (A m i n , A mox ),if A(A) becomes 0(e) for all 
A € (X m in, A max ), then all chiral properties should be also improved to that order. 



5 Domain Wall/Overlap Operator Correspondence 

To complete the dictionary between overlap and its corresponding domain implementation, one 
must show that all Fermionic correlators in a fixed gauge background, U^x), are equivalent. 
Namely that 

(0[q,q]) DW = (0[^,^]) ov , (5.1) 

where here the domain wall correlators are restricted to Dirac fields on the boundaries, (see 
Fig.§, 

q x = [V^] x ,s=i , q x = [VD DW (l)V] x , s=1 . (5.2) 
The Fermionic path integrals (• • •) dw and (• • -) ov are given by 

(0[<M>™ = / #A e'^^y^y o[^,^\ , (5.3) 

and 

(0[q,q}) DW = J dV(md$d$ e -^ D ™s( m )*^' -^D^ ys ,(l)% jS/ ^ (5 4) 

for overlap and domain wall Fermions respectively. In the latter ^f x ,s, ^x,s are 5d Dirac fields 
and & x ,si &x,s are 5d bosonic Pauli-Villars fields. 

Choosing the unit operator 0[q, q] = 1 the lowest order identity is 

Z m [U] = Z DW ]U] , (5.5) 
choosing the irrelevant field independent overall normalization constant appropriately. This iden- 



tity is a simple consequence of our "master" equation (5.12) for K DW = D D \ v (m)D(l)D\y'P ■ 
It is also instructive to see this directly as a field redefinition, = 7?t$ , = $D DW (1)V, 
in the domain wall partition function. The result is that the Pauli-Villars action is trivial: 
$D W (1)$ = $ To cancel the Jacobian an identical redefinition is needed for the Dirac 
field, Q = , Q = ^D DW {l)V. Thus one can rewrite the domain wall partition function, 

Z DW [U] = jd^e-Q^ DDW ( i r 1 D DW (rn)VQ = det{K DW h (56) 

which indicates again why the Dirac matrix, K DW , preconditioned by the Pauli-Villars matrix 
plays a central role. 
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Using Wick's theorem on a general operator establishes the full set of identities. For 
example Wick's theorem, 

= [P*9*D DW (l)P] xl , yl = [V^D D l w (m)D DW (l)V} xltyl = D™ y l (m) , 
gives the propagator identity, 

(q x q y )DW = D™~ l {m) = (ip x ip y )ov • (5.7) 

A formal derivation of all Fermionic correlators continues by iteration or more systematically 
by introducing sources for the Fermion field as a generating function. 

In order to avoid confusion it is important to note that our anti-spinor, 

q x = [*D DW (1)V] X>S=1 , (5.8) 

in Eq. 5.2 differs from the traditional choice 

q x = [V(-D_)KV] x>s= i = [*(-D_)V% tS=Ls = (1 - m)- 1 [^( J D w (l) - D DW {m))V] x , s=1 , 

(5.9) 

or q x = (1 — m)q x + [^Dj)w(m)V] XtS =i. The traditional definition results in the awkward 
subtracted overlap quark propagator 

(qxq y )DW = 1 — fe^m) ~ l]xy ■ (5.10) 

Our new definition of q smears the boundary field by one lattice unit by virtue of the appli- 
cation of D DW (1) operator. We believe this is a better choice since it simplifies the identities 
between domain wall and overlap correlation functions, suppressing reference to the particular 
implementation be it Shamir, Mobius or some future variant. Moreover, an advantage of our 
definition of q x is that changing the normalization of the operator Dow( m ) has no effect on 
the correlation functions since this change cancels with L>^]y(l). 

For our discussion of the axial current, it is useful to extend into the bulk both def- 
initions of the anti-spinor, Q s = [^>D DW (1)T>] S and Q s = [^>(-D-)V ] ] s for all s with their 
respective correlation matrices. These definitions give the following bulk to bulk 5d correlators 

Af7(m) = (Q s Q sl ) = [pt -A — D DW '(I)?].,, , (5.11) 



given in Eq. 2.27 above, and 



ilV(m) = (Q s Q sl ) = [pt__L_, £>_)7>t] a)i , (5 _ 12) 

respectively. The evaluation of these matrix elements (or correlators) are given in Appendix |Aj 
The mass dependence enjoys the nice factorization property, M DW (m) = A DW (m)M DW '(1). 

4 In particular q— [^>TZ'P] S=1 for the Shamir action with D_ = — 1. 
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y z y,l 2,1 

Figure 9: The overlap current insertion in the quark propagator is equivalent to a local 5d 
current insertion in the domain wall bulk summed over the 5-th co-ordinate s. 

5.1 Domain Wall/Overlap correspondence for currents 

In the continuum the vector and axial current can be defined by Noether's theorem through the 
local change of variable in the path integral, 

4>{x) -»• exp[i6^0r)A a + i^(x)A a 7 5 ] ip(x) 

V>(x) -> i>(x) exp[-i9^(x)\ a + i6 a A (x)\ a j 5 ], (5.13) 

resulting in Ward identities for the divergence of local currents. The current is then identified 
after integration by parts. An equivalent method is to gauge the action with a flavor gauge 
field and define the current as the linear response to this gauge field. On the lattice the gauge 
approach is superior particularly for non-local actions such as the overlap action. For non-local 
actions, the analog of integration by parts (i.e. summation by parts 1 ), needed to identify the 
current in Noether's approach, is difficult to define. 

For both the singlet and non-singlet vector current the "gauging" of the action on the 
lattice action is accomplished by the substitution U^{x) — > exp[i\ a A'l 1 (x)]U^(x) on each link, 
where A®, (x) is an adjoint flavor gauge field on link (x, x + fi). Now by applying the Domain 
Wall/Overlap equivalence, we must have an equivalence between the matrix element for 
vector and axial currents. From 

- iS A°( x ) (^i)ov = -i$Aa(x) (q l y ql)Dw , (5.14) 

we obtain the elegant identity, 

(J^° v (x) ^ J z ) ov = (J^ DW (x) qit)DW , (5.15) 



28 



neglecting for the moment a contact term that only contributes at x = z. The quark flavor 
indices are labeled by i, j = 1, • • • , nj. The vector current operators resulting from variations of 
the action are 

= -i5 A a {x) ^ y D ov (m) y ^ z = ^\ a V»(x)^ , (5.16) 

and 

ji-)DW {x) = -i §A%{x) yD DW -V + ■■■ = *A a V M (x)vI> + • • • , (5.17) 

for the overlap and domain wall actions respectively. To derive this is straight forward resulting 
in 

D ov{ m ){5A*{x) D ov{rn))D^(m) = (q y ,x = s >)dw {(^a^(x)^ d dw^}dw {^x', s q z ')DW 

- {Qy ^A^(x)Qz)dw ■ (5.18) 

The last term on the RHS is the contact term due to the dependence of q = \fy D DW (m)V\i on the 
gauge fields. It only contributes to x = z, so it can be ignored except at co-incident points. In the 
Shamir case the local source q = [*&V^]i can be used to avoid this contact term but as described 
above it then introduces into the propagator a contact term and it is awkward to generalize 
this to Mobius Fermions. Again as a consistency check, it is worth noting that given the bulk 
to boundary propagators (q^ s ')DW = [P^^flUb' and {^ s q) DW = [Dp\ v (m)D DW (l)V] sl , 
discussed in the Appendix [A], this identity (5.18) is equivalent to the variational derivative of 
our fundamental equation (2.28), 

SA % ( x )D-\m) = S A a [x) [pt Dd ^, ) D dw (1)V]u, (5.19) 

including the contact term. This leads to an explicit vector current overlap kernel as a sum over 
the bulk domain wall modes, 

Vf g (x) = D°l,(m)(q y ^ y ^ s/ ) DW V^(x) ylfs/ ^ s (^ x ^ s q zl ) DW D°J z (m) . (5.20) 

Like the overlap propagator itself, this current is non-local in 4d falling off exponential in \y — x\ 



and \z — x\. The + • • • in Eq. 5.17 refer to the Pauli-Villars term which does not contribute to 
the Fermionic matrix element being consider here. However, as we will show later, there are 
occasions such as conserved current-current correlators, where the Pauli-Villars contribution is 
required. It follows from the identity, 

( 4 a)ov (x)4 b ^(y)) = S A a {x) 6 A$(y) log[Z ov ] = 5 A a {x) 5 A a {y) \og[Z DW ] , (5.21) 

that the correlator is 

(4 a)ov (x)4 b) ° V (y)) = Y,(f, DW (^ s )ju DW (y,s)+jf V (x,s)jf v (y,s)) , (5.22) 
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as explained in detail in Sec. [6j 

Denning the axial current directly from the overlap action is more difficult because 
the global axial symmetry, which is realized by 75 = 75(1 — D ov (0)), is non-local and depends on 
the background gauge field, U^x). Thus for the axial current we choose to define the overlap 
axial current by imposing the condition, 

(J B M™(x) 10 x ) m = (4 {a)DW (x) qlqi)DW ■ (5.23) 



6 Ward-Takahashi Identities 



We now proceed to give explicit expressions for the vector and axial current for the Mobius 
domain wall action. We follow closely the literature in particular the paper of Furman and 
Shamir jS]. Since the 5d domain action is a function of the 4d Wilson kernel, we begin by 
reviewing the Wilson vector current to establish notation and useful intermediate kernels. This 
vector kernel contributes to both the vector and axial current. 



Wilson Vector Current: The vector current for Wilson Fermions is well know but let us 
repeat the argument to establish the notation and methodology for the chiral Fermions. By 



gauging the Fermions action (2.13) , 

gWUsonp^ = ^D Wihon [U li {x) i M 5 ]9 , (6.1) 
we compute the first order variation to get the vector current, 

W = -^(^^^[^(^^^^^-(a,)^ = *zV*,(x)9 v , (6.2) 
where the ultra local kernel is 

V£,(x) = ^^U^S^S^y + 7t±lul(x)S Z)X+tl S x>y . (6.3) 

Next we consider the linear responses to change of integration variables, *$> x — > exp[i\ a 8®]^ x , 
^ x — > ^> x exp[i\ a 9 x J ] in the path integral and applying Neother's theorem to get a divergence 
condition of this same current. The change in the action is 



JJ(x)-JJ(x-m) = A_^(x). " (6.4) 
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By Noether theorem we have the lattice current conservation condition, H 

52 Jtfx) - ^(x - ti) = **A a -=- - q^^x = , (6.5) 

using the classical equations of motion. Actually the divergence condition or Ward-Takahashi 
identity is a quantum constraint inside the path integral, 

(A^J-(x) O) = (6 e «0) , (6.6) 

which is often given the short hand notation by asserting that, A_^J^(x) = 0, is an "operator" 
equation. If the action or the measure is not invariant additional terms need to be included. 



6.1 Domain Wall Vector Current 

The derivation of the vector current for the domain wall action parallels the Wilson case closely. 
Again we gauge the action and take the variation, 

J^ DW {x) = -i8 A a^ xs D D w(rn) ys , zs \Up\$ zs > - i8 A a{ x )$ xs D D w{l)vs,zs'\Uv\<& )l8 > • ( 6 - 7 ) 
For the Shamir action the result is 

L a 

jf W {*) = ^[^sX^ix^s + $ s \ a Y»(x)$ s ] , (6.8) 

s=l 

expressed as the Wilson current averaged over all "flavors" in the 5-th direction. The second 
term involves the Pauli-Villars fields, which give zero contribution to quark correlators. The 
generalization to the Mobius vector current is straight forward. Taking the variation of the 



domain wall action in Eq. 2.19 we obtain 



L s L s L s 1 

Jf w (x) = b 5 J2^sX a Y^x)^ s + c 5 J2^sX a Y^(x)P + ^ s ^ + c 5 J2 *s\ a ^(x)P_* s+1 

s=l s=2 s=l 

- c 5 m(^ 1 X a Y^{x)P + ^ Ls +^ Ls X a Y^(x)P^ 1 ) + (tf,*,m -►$,¥, 1) . (6.9) 

The Pauli-Villars contribution is given exactly by the substitution indicated above. It may seem 
surprising that the domain wall vector current operator for Mobius Fermions have explicit bare 



mass dependence but the identity (5.15) with the overlap form requires this. Indeed, in view of 



Of a course "local" currents like gauge variable U ll {x), on the lattice, J/j.(x) = J(x,x + fj,), are really bilocal 
variables assigned to a positive links, (x, x + fx). The current in the negative direction carries the opposite sign: 
J(x + fi, x) = —Jp(x). 
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the form of the overlap action, Sf = (1 — m)ip[m/(l — m) + D ov (0)]ip we see that the effective 
overlap vector current is independent mass except for the overall factor of (1 — m) which as we 
mentioned before could be corrected trivially by renormalizing the field by \/l — m and thus 
removing all mass dependence from the vector current. 

An alternative approach to the 4d vector current is to begin with constructing a local 
5d conserved vector current, 

A^jf w (x, s) + A_^ DW (x, s) = 0. (6.10) 
This is accomplished in the same manner as before except now the domain wall action is gauged 



6.9 



by a flavor potential, A^(x,s), depending on the fifth dimension s. The 4d current in Eq. 
is formed by summing over the 5-th axis 

^(*) = £j2 dh W). ( 6 - n ) 

s 

and since no current leaks out of the 5-th axis, 

^^jf w (x,s) = 0, (6.12) 

s 

it is conserved as a 4d vector current. 

To get the full expression for the 5d vector for Mobius Fermions, we need to take care 
of off-diagonal terms D_P± by gauging, 

The resulting 5d "vector" current j%j(x, s) = (x, s) + j%f v (x, s) is 

jf W (x, s) = b^ s X a Y^x)^ s + c^ s X a Y^(x)P^ s+1 + c^ s \ a Y^(x)P + ^ s - 1 + 

- c 5 (l + m)^ 1 X a Y tl (x)P + ^ La 6 S>1 + 

- c 5 (l + m)* Ls \ a Y»(x)P^ 1 5 s , Ls (6.14) 

and 

jf w (x,s) = V x>s \ a D^P^ XtS+1 -y XtS+1 \ a D_P + y :E>s + 
- (1 + m)* XtLs \ a D_P^ xA 5 s , Ls + 

+ (l + m)* xA \ a D_P + * x , Ls 5 s , Ls . (6.15) 

where s + 1 is considered modulo L s . Again the Pauli-Villars current jff v (x, s) takes the same 
form with substitution ^f, ^ — > <£>,<!> and m — > 1. 
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6.2 Domain Wall Axial current 



The subtlety for the axial current for the domain wall action is that in 5d there is no chirality. 
The only local current is the 5d vector current defined above. The analog for 4d chirality is the 
parity transformation that reflects the 5-th axis exchanging the two domain walls. This is the 
essence of the descent relations [3] that inspired Kaplan's use of the domain construction in the 
first place. 

To implement this on the lattice, one simply splits the extra dimension on any link 
(M, M + 1) between the two domain walls (1 < M < L s — 1]) and defines local vector currents 
on each side. By gauging each side separately we get a Left current for s S [ljAf] and Right 
current for s G [M + 1, L s ]. The 4d Ward identities follows from Gauss's law applied to an open 
interval s £ [sq,si] where the outgoing links (sq — l,so) an d (si,«i + 1) on the boundary are 
dropped - parallel Dirichlet branes if you like. Now flux conservation requires including the flux 
through these boundaries, 

81 

A_ M j»(x, s) = j 5 a (x, s - 1) - j 5 a (x, Sl ) . (6.16) 

s=so 

Although the definition of the axial current can be done using an arbitrary point in the 5 th 
dimension to split the left handed current from the right handed current, we will restrict ourselves 
to the case where M is taken to be L s /2. Hence, we define the axial (and vector) currents by 
separating the left and right terms, 

La/2 L s 

= and J£(x)= J2 ( 6 - 17 ) 

s=l s=L s /2+l 

With periodic boundary condition Jg (x, L s ) — Jg(x, 0) = 0, we have the conserved vector current, 

L s 

Jf w {x) = J£(x) + Jg(s) = Y.3>i s ) ■ ( 6 - 18 ) 

s=l 

The odd parity axial current is found by subtraction, 

L a 

j£ti(x) = - j&x) = E r s( s - vw«) - ( 6 - 19 ) 

8=1 

where T$(s — L s /2) = e(L s /2 + 1/2 — s). By Gauss' law we obtain the Ward-Takahashi identity, 

A_ M jg^(x) = -2jl(x, L s ) + 2j|(x, L s /2) , (6.20) 
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where 



jf(x,L s ) = -mq x \ a <y 5 q x - <$> x , Ls 75^ a $x,l 
jl(x,L s /2) = Q x ,L 3 /2l^ a Q X ,L s /2+X + ^x,L B /2l5^x,L B /2+l ■ (6-21) 

6.3 Overlap Axial Current 

Similarly we can write down the kernel for the overlap axial current, 

J^(x) = D ov (m)(q^)J { ^(x)(^q)D ov (m) . (6.22) 

The remaining task is to use the map between domain wall to overlap Fermions, 

(J*W°°{x) %W z ) ov = (J^ DW (x) q y q\) DW + i(Q l y S A a (x) qi) DW , (6.23) 

to express the divergence in terms of overlap fields. Evaluating 

(A_ M jW"(x) if> y ^ z )ov = ((2mq x \ a ~f 5 q x + 2Q x , s j 5 \ a Q x , s+1 ) q y q z ) DW , (6.24) 

where s = L s /2, implies, 

A-^(x) = m^ x [( l5 + %)x/j] x + 2(1 - m)^ 7 5pf>)^ , (6.25) 

as we may readily check. Using the identity: 275 = (1 — m)(l + 7575) + 2D ov (m) the first term 
in Eq. 6.24| gives, 



[D~\m) - l] yx j 5 [D^(m)] xz = m[D~^ (m)W( 75 + ls)D^(m)] xz + CT , (6.26) 



(1 — m) 

where the contact terms (CT) are easily identified as 

CT = -77— — ~<[ 5 yxls D ™{ m )xz ~ D^(m) yx ^5 x ^ z } . (6.27) 
(1 — m) 

One may absorb these terms in an appropriate introduction of contact terms in the map from 



domain wall to axial currents (Eq. |6.23 ) analogous to those found for the vector current but the 
procedure is rather arbitrary and unphysical. The second term in Eq. |6.24 is 



(qQ x , s +lh 5 (Q x , s q) = (1 - m) J D- 1 (m) 75 Af' +1 Af D^{m) , (6.28) 



where we define the Left and Right breaking term by 



Tr 1 ...^ 1 ,^, T7\ n ...TT l 



a^a« = t L ;: T -t +1 n \ sl HX" r y - , (6-29) 
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whose space-time average Y^xPl s ( x ) = ^L 3 is precisely the correct breaking term for global 



chiral symmetry found earlier for the overlap action (2.9). In addition note that due to the 
conservation of 5d flux, the result is independent of the location (s = L s /2) of the mid 
plane slice. Changing the position merely redefines the axial current by a term with zero total 
divergence. Nonetheless the natural definition is to take the mid-plane so that parity is equivalent 
to the reflection operator, 1Z : s — > L s — s. 

So far we have dealt with the non-singlet sector for the currents. This singlet sector 
requires some special considerations. In the vector channel one wishes to introduce finite chem- 
ical potential for the Baryon number. In the axial channel one needs to show how the axial 
anomaly arises in the domain wall formalism. In both cases a natural representation in our 
domain wall/overlap correspondence exists. 



6.4 Axial anomaly 

The flavor singlet domain wall axial current is anomalous, as has been shown by Kikukawa 
and Nuguchi [26J. The way this comes about is instructive. This has been computed from the 
self-contraction of the Fermionic fields, Q and Q, in the bilinear term Ql s /2( x )15Ql s /2+i( x ) of 
Eq. |6.24| at the mid-plane of the fifth dimension. This term gives rise both to a contribution 
to m res (through contraction with the boundary fields) and to a new term that survives in 
the L s — > oo limit. In order to rigorously perform the calculation, one needs to use the map 
between domain wall and overlap currents that requires both the domain wall Fermions and the 
Pauli-Villars "bosons". The result is that this Fermion contribution at the mid-plane is exactly 
canceled by the Pauli-Villars contraction and replaced by a boundary contribution at the domain 
wall. This is really the correct way to understand the physics. For example suppose we modified 
the domain wall implementation as for example suggest in Ref. [39] by allowing slightly non- 
uniform gluon fields, X j S J , CIS cl function of the fifth coordinate. Without the Pauli-Villars 
cancellation, the chiral quarks at the boundary would "feel" the wrong gauge potential at the 
mid-plane giving unphysical and indeed incorrect contributions. This mismatch is cured by the 
correct calculation. 

The calculation proceeds as follows, 

(^ ji Dw( x )) = 2m(q x -/ 5 q x ) + 2(Q XtLa/2 >y 5 Q XtLs/2+1 } 

- 2(^,^75^,1) + 2<$ X , W27 5<F E , W2+1 ) . (6.30) 

The first term is the usual quark mass contribution. The second gives two contributions using 
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the identity 



- (Qx,L s /2l5Q x ,L s /2+i) = Tr[j5A L3 / 2+ljS (rn)M StLs/2 (l)]x,x (6.31) 

= (1 - m)rr[Af s/2+1J D - 1 (m) 75 Ai s/2 ] x , :[ . + Tr[^M La/2+1;Ls/2 (l)] XjX . 

where Tr[- ■ ■] traces only over color and spin with fixed space time point x. This contributes 
both a term corresponding the chiral violation for m res ^ and a new term which exactly 
cancels with the mid-term contribution of the Pauli-Villars fields, 

- (* as ,L./a76*x,r./2+i> = -Trh 5 M Ls/2+hLs/2 (l)] x , x . (6.32) 

The change in sign is due to Bose versus Fermi statistics. Instead now the anomaly comes for 
the boundary Pauli-Villars term, 

2(^,^75^,1) = lim -^—Tr[>K{rr-\m) - 1)} X , X = 2Tr[ 75 (l - ^(0))]^ . (6.33) 
m— >l 1 — m 

In the limit of L s —> oo, summing over the toroidal volume this gives the lattice Atya-Singer 
index for the instanton number, (1/2) ^ x Tr[j5D ov (0)] XjX = n + — n_. In the continuum limit, 



Eq. 6.33 gives the topological charge density (1/327T )Tr[F tl ^(x)F ti ^(x)] as required [2]. The 



Pauli-Villars term is doing its job by canceling all the heavy cut-off modes in the interior. 



6.5 Residual Chiral Violations 

At fixed values of L s the "residual mass" is a common criterion to measure the magnitude of 
chiral symmetry violations. This is defined in Ref. [34, 36J by the correlator 



/.n _ EgO'sK t, Lg/2) j 5 (0, t, L 9 )) c _ T,5-(Qx,a5Q x ,t go75go)c . s 

in the plateau region with t away from the source and sink. Note however in our definition 
of m res , we are using our anti-spinors q in the denominator that removes the unwanted factor 
(1 — m) 2 , which of course is irrelevant to the chiral limit (m = 0). The fields Qg^ = Q x ,l s /2i 
Qx,t = Qx,l s /2+i are the domain wall fields at the mid-plane link: (L s /2,L s /2 + 1). The 
restriction to connected contributions is a consequence of defining the residual mass via non- 
singlet pseudoscalar sources which have no disconnected diagram. 

Computing the contractions for the connected diagram and using the identities in 
Sec. A. 2 proven in Appendix [A] for the two point correlators, 



l 5 (qoQxh 5 (Qx go) = Dl- l (0,y)pl z (x)D- l (z,0) 
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we have 

(t) = ExTr[(q Q x ) 75 (Q x go) 75 ] = ExTr^DjPl 1 } ^ (0|g75P(0)<zK) 

(6.35) 

where, 

p z L y s (x) = A^ y , (6.36) 



is defined in Eq. 6.29 The expression on the right in terms of the pion to vacuum matrix element 



holds for large t separating the source and the sink. 

A perhaps less practical but more elegant definition of the residual mass is to sum 
over all time slices, 

_ Eg,t(Q$,tJ5Qg,t <7075<?0>c 

m res = — — , (6.37) 

resulting in a form, 

_ TrlAL^H^D^Dtv 1 } ^ 

= — — 9 — 7 -n = > w n (X) A Ls (A) = (A Ls ) n , (6.38) 

"-i nt -1 



Tr 



ov ^ou 



better suited to theoretical analysis. Now the trace includes the sum over the spatial index as 
well. On the right we introduced the spectral weight for the pion correlator, 

(x\p^D^\X) 

In the limit m — > 0, the improved definition m res is the normalized trace of the violation of the 
Ginsparg- Wilson relation written as 

D^(0)A Ls (H 5 )D-\0) = + 75^(0)75 - 2)/2 . (6.39) 

For the polar decomposition the operator Al s is positive definite because the approximation 
always underestimates the sign function: |e£ s (a;)| < 1. Consequently, zero residual mass (m res = 
0) implies the exact Ginsparg- Wilson relations and unbroken Ward-Takahashi relations. 



Using this identity (6.38), it is easy to model the residual mass with a reasonable 
approximation to the spectral density. This model captures well the trends seen in our numerical 
results in Sec. [4| For the spectral density, w(X), we note that it is plausible that it will have 
negligible dependences on L s and a parameterized in terms of unsealed eigenvalues of H§ = 
-y 5 D Shamir (M 5 ) so that 

m res ~ W *( X ) A L S («A) . (6.40) 



A 
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A simple model for w n (X) is to approximate a few smallest eigenvalues by a finite density p(0) 
due to "small topological defects" and at larger eigenvalues by the free kernel. Indeed this ansatz 
is able to give a good fit to our empirical study of the parameter dependence of m res in Sec. |4j 

At large L s the defect dominate the contribution to ui res . Using the expression 
€£ J3 (aX) = tanh(x) with x = L s log(l + qA) — -L<jlog(l — aA)) we get the expression for the 
error 

A L >A) = l/(4cosh 2 (x)) -> e~ L '\ + aA ) " lo §( 1 " aA )l , (6.41) 

for 0{L~ l ) < \a\\ < 0(L S ). Outside this window the error is 0(1). For large L s the error is 
dominated by the small eigenvalues in the interval |A| < l/(aL s ). For the standard Shamir form 
(a = 1) this causes the residual to fall like m res ~ p(0)/L s asymptotically but for the rescaled 
Mobius with a ~ L s /X max we estimate the residual to fall like m res ~ p{Q)/L 2 s asymptotically^} 
This improvement is, we believe, the basic explanation for the superior chirality of the Mobius 
algorithm. 

As we mentioned above, the polar approximation to the sign function results in a 
positive residual mass. However, other polynomial approximations, such as Zolotarev, of the 
sign function may oscillate around e(x) so positivity is lost. In fact one can even "tune" m res to 
zero but this does not imply that chiral symmetry is exact. Instead it implies that the lowest 
order dimension 3 operator in to the chiral Lagrangian is given by the quark mass. The real 
issue is higher order terms, in particular the dimension 5 chiral symmetry breaking operators. 

6.6 Baryon current and Chemical Potential 

The chemical potential couples to the singlet charge of the vector current, i.e. Baryon number. 
In this section we point out the relation of the vector current we already defined to the non- 
zero chemical potential formulation of the overlap and domain wall Fermions. It is useful to 
note that the same strategy can be used to define currents for kernels that violate 75 Hermiticity 
leading to complex determinants such as the Dirac operator with a chemical potential. Block and 
Wettig |45] have given the overlap operator for non-zero chemical potential. Their construction 
is tricky because of the need to define the "sign function" for a non-Hermitian kernel "e(A)" , 

6 Strictly speaking one should be careful about the order of limits. The eigenvalue distribution is only properly 
given by a density function, p(A), in the limit of infinite lattice volume (I/ 4 ), so here at finite volume p(0) should 
be replaced by a measure of the mean number of eigenvalues in the interval near A = where the exponential 
approximation to e(A) fails. At fixed volume, L 4 , exponential convergence to m res — is expected to resume for 
L s ^> L but this is of little practical importance. 
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where A = j 5 D Wilson (M 5 , fj,) is given by making the standard substitution for non-zero chemical 
potential, 

(1 ± >n)U±i(x) -> (1 ± 7i )e^U ± ^x) (6.42) 
into the Wilson operators for all time like links. Their rule is to define the "sign function" by 

u e(A)" = Ssign(Re A)S- 1 , (6.43) 

where S is the similarity transformation that diagonalizes the kernel A = SAS^ 1 . 

It is straight forward to rederive their prescription and generalize it for any kernel at 
finite L s by simply inserting this Wilson operator (for non-zero ti) into our generalized domain 
wall formalism. In the domain wall form we get, 

T~ La - 1 (1 + A) La - (1 - A) L ° (1 + A) Ls - (1 - A) Ls ! 
eL.K A ) t -l s + 1 (1 + A) Ls + (1 — A) Ls (l + A)^ + (1-A) L ' ' { ' 

where we diagonalize A = SAS~ 1 with eigenvalues A = x + iy. As shown in Appendix |aJ 
A = (T- 1 + l)" 1 ^- 1 - 1) = l5 (D + + D_)/[ 75 (D + - L>_) 75 ] or A = 7 5^(^) for Borici. But 
using |(1 ± A) Ls | = |1 ± A| Ls or |(1 ±x±iy) Ls \ = [(1 ±x) 2 + y 2 ] La l 2 we see that the exponentially 
dominant of the two terms for each eigenvalue is the one where the sign of ±x is positive. Hence 
we get 

e Ls (A) 5sign(i?eA)^ 1 , (6.45) 
in exact agreement with Block and Wettig at L s = oo. 
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7 Conclusions 



In this paper we have reviewed the Mobius class of chiral domain wall Fermion operators [17\ [18] . 
Since they are just now coming into wider use in production codes, a general presentation of the 
formalism is perhaps warranted. We have sought to emphasize several features critical to their 
performance and chiral properties. On the performance side, we note that the Mobius kernel 
operator requires no additional applications of the 4-d Wilson Dirac kernel per domain wall 
iteration. However we are required to replace the conventional 5-d red-black preconditioning 
by a 4-d checkerboard with constant color in the 5-th axis in order to avoid a 4-d Wilson 
operator inverse as a new inner loop. In fact such 4-d even/odd decomposition is needed for 
the earlier Borigi domain wall formulation |27[ [28], as well as any operator that goes beyond 
nearest neighbor terms in the 5-th direction. A highly optimized parallel Mobius Domain Wall 
Fermion (MDWF) inverter for clusters and the Blue Gene architecture has been freely available 
for several years [21] as well as Hybrid Monte Carlo evolution code in Chroma [35]. Very soon 
the Mobius inverters will be available in the QUDA g7] (QCD in CUDA) library for NVIDIA 
multi-GPUs platforms as well as a full Hybrid Monte Carlo evolution code in the Columbia 
Physics System [58] (CPS) for Mobius fermions optimized for the BlueGene/Q. 

In addition, in this paper we worked out the form of the conserved and partially 
conserved axial vector currents for Mobius domain wall fermions, as well as their Ward-Takahashi 
identities. The goal was to have a general approach that maps the domain wall expressions at 
finite L s into their equivalent form for the effective 4-d overlap action. A byproduct of our 
formalism is a simple derivation of the overlap operator at finite chemical potential, a result 
first obtained by Block and Wettig [45J. Also we show how the general expression for the 
residual mass, in the case of non-vanishing zero mode density, implies quadratic convergence, 
m res = 0(1/ Lif) , for the appropriately scaled Mobius fermions for large L s in contrast to the 
slower linear convergence, m res = 0(1/ L s ), for Shamir. This largely explains the reason for 
the improved chiral behavior of the Mobius rescaling algorithm. Indeed, for the test ensemble 
we used, our numerical tests support this picture and even suggest that Mobius at L s = 32 
should correspond roughly to running Shamir with L s = O(10 3 ). Of course the latter is neither 
practical or even easily amenable to direct numerical verification. 

Perhaps of more interest is to consider using the Mobius algorithm to reduce L s sub- 
stantially. For new HMC runs, one can be even more aggressive. For example as illustrated 



in Fig. 10, the typical simulation with Shamir at L s = 16 might be run with Mobius at L s as 
low as L s = 4 with a tolerable compromise on chirality. Perhaps not all simulations can be this 
aggressive but there may be some that can. For example exploratory investigations for Beyond 
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Figure 10: Comparison the approximate of the sign function e^ s (aA) plotted against A for 
Shamir (a = 1) at L s = 16 with rescaled Mobius (a = 2) at L s = 8 (indistinguishable) and 
the lower curve scaled Mobius (a = 4) at L s = 4 . The support for the kernel is bounded by 
eigenvalues |A| < (8 — Ms)/(10 — M5) ~ 0.75 so the visible degradation at the top is limited and 
should not be significant for the chiral physics at small eigenvalue. 



the Standard Model (BSM) physics might try this at least in early broad surveys of the gauge 
theory landscape. 

We are also aware that as the Mobius algorithm comes into more common use, there are 
potentially opportunities to combine it with other algorithmic methods such as Hasenbusch [39] 
mass precondition or multigrid preconditioning |23t [22] . For the latter it is interesting to note 
that the first level of domain wall multigrid maps the 5-d operator into an effective 4-d coarse 
operator [24J suggesting many avenues of investigations that could allow domain wall codes 
to compete favorably with 4-d Wilson codes. One example of a hybrid algorithm that has 
received some preliminary investigation is combing the Mobius algorithm with the idea of Gap 



Fermions [50j to further reduce the value of m res at fixed L s as noted in Fig. 11 reproduced from 
Ref. |19] . The figure shows that the smoothing consequences of introducing a Gapped action can 
further reduce the residual mass by as much as an order of magnitude for L s = 16 for example. 

The interaction between different algorithmic methods opens up a large range of pos- 
sible improvements worth of serious study. The RBC collaboration has been pursuing similar 
strategies |51j . to try to balance the apparently conflicting desire for better chirality while still 
facilitating sufficient thermalization of topological charge sectors. We have no magic solution 
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Figure 11: The Mobius algorithm (a = 2) on pure gauge and Gap lattices vs Shamir (a = 1). 



to this but of course the interaction of Mobius fermions with other algorithmic approaches begs 
further explorations in this endeavor. 
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A Mobius generalization of Domain Wall Operator 



This appendix collects together the basic identities that relate our generalized domain wall 
operator at finite L s to the effective operators in the 4d overlap world. The class of domain wall 
operator (including our Mobius and Zolotarev cases) has next to nearest neighbor interaction in 
the fifth axis, 



D DW (m) StS 



s'+l 



mD W P + 6 Sj1 5 s , jLs -mD {Ls) P_ 5 S>L J S , A , 

W 



with s, s' = 1,2, ■ ■ ■ L s mod L s cyclic moduls L s and P± = !(l±7s) and = b§(s)D 
1, D 



\Wilson 



(A.l) 

(M 5 )+ 



(s) 



c 5 (s)D Wilson (M 5 ) - 1. The Wilson operator is 

1 
2 

In matrix notation the domain wall operator is 



DY y llson (M 5 ) = (4 + M B )6 x>y -Ul- 7,)^WWs + (! + 7a,)0J(i/) Ww* 



(A.2) 



D DW (m) 







-mD {Ls) P_ 



D®P + 



D 





[2; 

(3) 



D (2) p _ 





We will also on occasion discuss the equally valid "left" form 

D DW (m) = DZ 1 D DW (m)D_ , 



-mD W P^ 





D 



(A.3) 



(A.4) 



Diag[D^,D^,D^,..., 


D^ s ^\, which rotates the chiral projectors 


to the left, 




r d« 


p_ D (2) 


• • • -mP + D {Ls) ~ 










p_D {3) ■■■ 




D DW (m) = 





P + rt 2) 


••• 


(A.5) 




-mP-D W 





• • • z?l Ls) 





because of its possible advantage for efficient code. 



A.l LDU decomposition 



To relate the 5d domain wall matrix to the 4 dimensional overlap form, one merely performs a 



standard LDU decomposition of Eq. A.3, although the notation in the domain wall literature 
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(which we adhere to here) is unconventional and obscures this a bit. Consider the decomposition, 

D DW V = "UDL" . (A.6) 
The interchange of U and L relative to conventions of mathematics texts is inconsequential since 



reflecting the 5-th axis with transformation, 1Z defined in Eq. 2.23 converts any U matrix to an 
L matrix and vice versa. 



The first step is to multiply by the permutation (or pivot) matrix that performs a left 
shift C for positive chirality components: 



form: 
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P+ 

















p_ 


p + ... 







V = P-I + P+C = 








p_ ... 





(A.7) 
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gives an upper diagonal 
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(A.8) 
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or, 



D DW (m)V = 7 5 Q_ 








-Tf 1 
1 










(A.9) 



,Q^ s) \. Here we define 



after factoring out the the diagonal matrix, Q_ = Diag[Q^\ Q_ . 
the mass dependent constants, c± = P± — mP T = |(1 — m) ± 1(1 + rnjjs, and the 4d local 
"transfer matrix", Tr s+l s i = T s , from s to s+1 on the [s + 1, s] or in fact its inverse, 



[s,s+l] 



p 



-1 



(A.10) 



This operator is Hermitian and relates to a 5d s dependent Hamiltonian operator by the identity, 

T7 1 - 1 1 



Pf, 



+ 1 0% - Qi 



-(Q s + + Qi.)=j 5 



D (S) + £,(_-) 



D (s) _ D ( S ) 



(A.ll) 
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(A.12) 



Thus we have an s dependent 1 version of the Mobius operators given in Eq. 2.14 

(b 5 (s) + c 5 (s))D WUson (M 5 ) 

8 l5 2 + (b 5 ( S )-C 5 (s))DWUson {M5) ■ 

The remaining two steps are Gaussian elimination with "U" and back substitution with "L" , to 
obtain Q 

D DW {m) V = 75 Q- UD 5 (m) L(m) , 



(A.13) 



in terms of 
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1 -Tf 

1 







L(m) 



-1 

-%i]C+ 1 

-%i] c+ 1 

-T[Ls,i]C+ 



(A.14) 



where 2} s ji = T s 1 T S+ L 1 • • -Tl s , assuming periodic index notation 1 = modL s + 1. For future 
reference this notation is generalized to s-ordered products ( 1 < s < s' < L s ), 
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I [s,s'+l] - J s J-s+l ■ ■ ■ J^gi 
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We also introduced the matrix: D^{m) = Diag[D±(m), 1, • • • ,1], where D^m) = T Ls c+ — c_, 
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The inverses are L 1 (m) = L(m) and 





1 


T [l,2] 


T [l,3] • 


• T [1,L S 







1 


T [2,3] • 


" T %L S 


u- x = 








1 


" T [3,L S 













1 



(A.18) 



The only non-trivial diagonal element in D§(m) can be factored 

-I/.n^ / x 1 + 771 1 — 771 



D4m)Efl 4 - I (l)fl 4 ( m ) 



+ 



756LJH], 



(A.19) 



7 Again we point out that the notation is a little unconventional relative to the mathematics literature, since 
we should really identify the "U" matrix as 75 Q_ U QZ 1 "fs, absorbing 75 Q_ into the diagonal matrix. 
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where 



1" 



1 



^M^j^Ti' (A - 20) 

and H = (1— T)/(l+T). Gamma 5 Hermiticity requires that H and therefore that T is Hermitian 
and therefore, T X T 2 T 3 ■ ■ ■ T Ls = T Ls ■ ■ ■ T^T 2 T X . 



It is now straight forward to compute the matrix in Eq. |2.27 
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D DW (± 
and its inverse in Eq. |2,28[ 
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Finally we have the crucial identity that relates the 5d and 4d determinants 

Det[D DW -\l)D DW (m)} = Det[D ov (m)] , 



(A.21) 



(A.22) 



(A.23) 



(A.24) 



because the j^Q-U factor cancels in the product and — Det[L(l)] = —Det[L(m)] = 1. This 
completes the proof that the overlap measure is equivalent to the domain wall Fermion measure 
with Pauli-Villars pseudo- Fermion field to give the factor Det[D DW ~ 1 (l)]. 

A. 2 Domain Wall Correlators 



The general bulk to bulk propagator, Eq. 2.28 



i^s^s) = D D l w {m) s , s 



(A.25) 



is the domain wall inverse itself. For the Ward-Takahashi identities we need also to have bound- 
ary to bulk propagators to interior points in the domain wall. One fundamental set is 



[V^D- 1 



DW 
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[D- l w (m)D DW (l)Vy 



(A.26) 
(A.27) 
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In addition we need bulk to bulk propagators between quark spinors 

Q s = [P^] s = P_* s + P + ^ s -! (A.28) 
and two varieties of anti-quark spinors 
Q s = \*D DW (l)P] a 

Q s = [y(-D^)V% = y s (-D^)P- + y s+1 (-D^ +1) )P + . (A.29) 
Thus we introduce the two correlators. The first, 

A™(m) = (Q S Q S ,) = [pt DD l {m) D DW {l)V]s,s> , (A.30) 
is the fundamental formula derived above. The second "mass" term, 

M s ,Am) = (Q S Q S ,) = [pt _L_ ( _ £> _ )7? t]^ / j (A.31) 

is needed for the axial. There is also a very convenient factorization formula, M(m) = A DW (m)M(l), 
for this. 

For the flavor chiral ward identity two correlators are needed 

A™ = (Q s q) = (1 - m)X s ^D- ov \m) = (1 - m)T; 1 • • • f - ^ D^\m) 

= (l-m)AfD- v 1 (m). (A.32) 

The second one is 

Mi, s = (qQ s ) = ^-\m) 1 + 1 J _ Ls [rf 1 • • • T-^s = J D- 1 (m) 75 Af 75 . (A.33) 

This follows from the reflection property T.T 1 = Tj~^ +1 _ s , the identity q = (1— m)q+[^>Dow{ m )P]i, 
which implies 

M s ,i = (Q s q) = A? D" 1 (m) + <y a ,i , (A.34) 
and the fact that TZDZ 1 D DW (m) is 75 Hermitian, so that (gQ s ) = ^'R.ss'iQs'ffljs- 

Finally for the chiral anomaly in the singlet current, we need the correlator 

M ss ,{m) = (Q S Q S ,) = A™{m)M s „, s ,{l) = (1 - m)^D^{m)M ltS ,{l) + M s , s ,(l) 

= (l-m)AfZ) - 1 (m) 75 Af; 7 5 + M SiS ,(l), (A.35) 

for the anomaly with s = L s /2 + 1, s' = L s /2. 
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